On Effective Decomposition of Training Data Sets for Min-Max Modular Classifier

نویسندگان

  • Hai Zhao
  • Bao-Liang Lu
  • Kai-An Wang
چکیده

Our previous work shows that traditional randomization partition method for min-max modular (M) classifier can not ensure stable generalization accuracy when the number of two-class problems increases. To overcome this drawback, we consider how to effectively decompose the training data set for a two-class problem in this paper. We propose four basic clustering and anti-clustering strategies and their combinations for partitioning training data sets. These four basic strategies are hyperplane decomposition, K-means algorithm, anti-K-means algorithm, and scatter procedure. Our experimental results show that all the proposed clustering and anti-clustering strategies are superior to the traditional random partition method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Task Decomposition Using Geometric Relation for Min-Max Modular SVMs

The min-max modular support vector machine (M-SVM) was proposed for dealing with large-scale pattern classification problems. M-SVM divides training data to several sub-sets, and combine them to a series of independent sub-problems, which can be learned in a parallel way. In this paper, we explore the use of the geometric relation among training data in task decomposition. The experimental resu...

متن کامل

Learning from imbalanced data sets with a Min-Max modular support vector machine

Imbalanced data sets have significantly unequal distributions between classes. This between-class imbalance causes conventional classification methods to favor majority classes, resulting in very low or even no detection of minority classes. A Min-Max modular support vector machine (M-SVM) approaches this problem by decomposing the training input sets of the majority classes into subsets of sim...

متن کامل

Applying Spectral Clustering Algorithm on Min-max Modular Support Vector Machine

Through task decomposition and module combination, min-max modular support vector machines (M-SVMs) can be successfully used for different pattern classification tasks. A effective task decomposition strategy has been proved that is very critical to the final classification results. Based on a spectral clustering strategy, M-SVMs can divide the training data set of the original problem into sev...

متن کامل

Prediction of Protein Subcellular Multi-localization by Using a Min-Max Modular Support Vector Machine

Prediction of protein subcellular location is an important issue in computational biology because it provides important clues for characterization of protein function. Currently, much effort has been dedicated to developing automatic prediction tools. However, most of them focus on mono-locational proteins. It should be noted that many proteins bear multi-locational characteristics, and they ca...

متن کامل

Thyroid disorder diagnosis based on Mamdani fuzzy inference system classifier

Introduction: Classification and prediction are two most important applications of statistical methods in the field of medicine. According to this note that the classical classification are provided due to the clinical symptom and  do not involve the use of specialized information and knowledge. Therefore, using a classifier that can combine all this information, is necessary. The aim of this s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005